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Pursuant to 3 5 U.S.C. §2 02 (c) , it is 
acknowledged that the U.S. Government has certain rights 
in the invention described herein, which was made in part 
with funds from the United States Department of 
5 Agriculture, Grant Nos . 58-3 655-3-107 and 58-3 655-4-14 0, 
and from the National Institutes of Health, Grant No. 
GM33716-08S1 . 



FIELD OF THE INVENTION 

10 This invention relates to the field of disease 

resistance in plants. In particular, the invention 
provides a novel avirulence gene from the rice blast 
pathogen, Magnaporthe grisea, and methods of using the 
gene and its encoded products for improving resistance of 

15 rice to this pathogen. 

BACKGROUND OF THE INVENTION 

Rice is a major staple food for about two- 
thirds of the world's population. More than ninety 

2 0 percent of the world's rice is grown and consumed in 

developing countries. Rice blast disease, caused by the 
fungus Magnaporthe grisea, threatens rice crops 
worldwide. The disease can cause yield losses of ten to 
thirty percent in infested fields. Rice blast has been 

2 5 an ongoing problem in rice growing areas of the southern 
United States. It has now become a significant problem 
in rice growing areas of California, as well. 

The "gene- f or-gene" hypothesis has been advanced 
to explain the very specific disease resistance / 

30 susceptibility relationship that often exists between 
races of a plant pathogen and cultivars of its host 
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species. The gene-f or-gene hypothesis has been found 
applicable to many host -pathogen interactions, including 
that of the rice blast fungus, Magnaporthe grisea, and its 
host, Oryza sativa. To be able to understand and 
5 manipulate this host-pathogen relationship is of great 
practical interest as M. grisea is rapidly able to 
overcome new disease resistance in rice soon after their 
deployment. Moreover, M. grisea exists as a complex genus 
with many subspecific groups that are infertile, but 

10 differ in their host range. How these different 

subspecific groups interrelate evolutionarily is of great 
concern to plant breeders since some of these alternate 
hosts are frequently found growing in close proximity to, 
or in rotation with rice, and M. grisea isolates infecting 

15 these alternate hosts can sometimes also infect rice. 

Gene-f or-gene resistance (also known as 
hypersensitive resistance (HR) or race - specif ic 
resistance) depends for its activation on specific 
recognition of the invading pathogen by the plant. Many 

2 0 individual plant genes have been identified that control 
gene-f or-gene resistance. These genes are referred to as 
resistance (R) genes. The function of a particular R 
gene depends on the genotype of the pathogen. A pathogen 
gene is referred to as an Avr gene if its expression 

2 5 causes the pathogen to produce a signal that triggers a 

strong defense response in a plant having a corresponding 
R gene. This response is not observed in the absence of 
either the Avr gene in the pathogen or the corresponding 
R gene in the plant. It should be noted that a single 

3 0 plant may have many R genes, and a single pathogen may 

have many Avr genes. However, strong resistance occurs 
only when an Avr gene and its specific R gene are matched 
in a host-pathogen interaction. In this instance, 



resistance generally occurs as activation of a HR 
response, in which the cells in the immediate vicinity of 
the infection undergo programmed necrosis in order to 
prevent the further advance of the pathogen into living 
5 plant tissue. Other features of the resistance response 
may also include synthesis of antimicrobial metabolites 
or pathogen- inhibiting enzymes, reinforcement of plant 
cell walls in the infected area, and induction of signal 
transduction pathways leading to systemic acquired 

10 resistance (SAR) in the plant. 

The molecular basis of host -cult ivar 
specificity and pathogenic variability in M. grisea is 
only beginning to be elucidated with the identification, 
mapping and, in some instances, cloning of specific Avr 

15 genes from pathogenic isolates of M. grisea. For 

instance, AVR2 -YAMO (cultivar specificity) and PWL2 (host 
specificity) (Valent & Chumley, pp. 3.113 - 3.134 in Rice 
Blast Disease (R. Zeigler, S.A. Leong, P. Teng, Eds.), 
Wallingford: CAB International, 1994) both function as 

20 classic avirulence genes by preventing infection of a 

specific cultivar or host. AVR2-YAM0 encodes a 223 -amino 
acid protein with homology to proteases, while PWL2 
encodes a 145 -amino acid polypeptide which is glycine- 
rich. Based on the predicted amino acid sequences of the 

2 5 proteins, both may be secreted. 

Homologs of both AVR2-YAMO and PWL2 appear to 
be widely distributed in rice and in other grass- 
infecting isolates of M. grisea., thereby confirming that 
M. grisea isolates which do not infect rice still may 

30 carry host or cultivar specificity genes for rice. In 
some cases, homologs of AVR2 - YAMO and PWL2 have been 
shown to be functional and to exhibit the same host or 
cultivar specificity as AVR2 - YAMO or PWL2 . 



As another example of a potentially useful Avr 
gene, the cultivar specificity gene AVR1 -C039, which 
determines avirulence on rice cultivar C03 9, has been 
identified (Valent et al . , Genetics 127 : 87-101, 1991) 
5 and mapped to a position on M. grisea chromosome 1 (Smith 
Sc Leong, Theor . Appl . Genet. 88: 901-908, 1994). A 
segment of chromosome 1 that appears to contain the AVR1- 
C039 gene has been isolated and cloned into a cosmid 
vector (Leong et al . , pp. 846-852 in Rice Genetics III, 

10 Proceedings of the Third Annual Rice Genetics Symposium , 
G.S. Khush, Ed., Island Harbor Press, Manila, 1996); 
however, the gene itself heretofore has not been 
identified and characterized. 

The availability of cloned cultivar and host 

15 specificity genes from M. grisea and, ultimately, the 

corresponding R genes from rice provides useful tools for 
manipulating and augmenting resistance to this pathogen 
in the field. Accordingly, it is an object of the 
present invention to provide a new cloned M. grisea 

2 0 cultivar specificity gene, AVR1-C039 , and its functional 

homologs for such use. 

SUMMARY OF THE INVENTION 

According to one aspect of the invention, there 
25 is provided an isolated nucleic acid, AVR1-C039, from 
Magnaporthe grisea that confers rice cultivar C03 9- 
specific avirulence to fungal plant pathogens that 
contain the nucleic acid. The nucleic acid preferably 
comprises part or all of Sequence I.D. No. 1, or 

3 0 hybridizes with part or all of Sequence I.D. No. 1 or its 

complement . 

According to another aspect of the invention, 
there is provided a polypeptide encoded by part or all of 



the isolated nucleic acid of claim 1. Preferably, the 
polypeptide is selected from the group of polypeptides 
encoded by ORFS 1, 2, 3, 4, 5, 6 and 7, corresponding to 
Sequence ID No's. 2, 3, 4, 5, 6, 7 and 8, respectively, 
5 and most preferably is encoded by ORF 3 . 

According to another aspect of the invention, a 
transgenic epiphytic bacterium is provided, which 
expresses a portion of an AVR1-C039 gene effective to 
confer rice cultivar C039-specif ic avirulence to 

10 microorganisms expressing the gene. Preferably, the 

transgenic epiphytic bacterium expresses ORF3 of Sequence 
ID No . 1, or a functional equivalent. 

According to another aspect of the invention, a 
method of enhancing the scope of resistance of rice 

15 cultivar C039 plants to pathogenic microorganisms is 

provided. The method comprises treating the plants with 
an epiphytic bacterium that expresses a portion of an 
AVR1-C039 gene effective to trigger expression of a C039- 
specific R gene in the plants. 

2 0 According to another aspect of the invention, a 

second method of enhancing the scope of resistance of 
rice cultivar C039 plants to pathogenic microorganisms is 
provided. This method comprises treating the plants with 
a protein extract comprising polypeptides produced by 
25 expression of AVR1-C039 , in an amount effective to 
trigger expression of a C03 9-specif ic R gene in the 
plants . 

These and other features and advantages of the 
present invention will be described in greater detail in 

3 0 the description and examples set forth below. 



DETAILED DESCRIPTION OF THE INVENTION 
I . Definitions 

Various terms relating to the biological 
molecules of the present invention are used hereinabove 
and also throughout the specifications and claims. The 
terms "substantially the same," "percent similarity" and 
"percent identity" are defined in detail below. 

With reference to nucleic acids of the 
invention, the term "isolated nucleic acid" is sometimes 
used. This term, when applied to DNA, refers to a DNA 
molecule that is separated from sequences with which it 
is immediately contiguous (in the 5' and 3' directions) 
in the naturally occurring genome of the organism from 
which it was derived. For example, the "isolated nucleic 
acid" may comprise a DNA molecule inserted into a vector, 
such as a plasmid or virus vector, or integrated into the 
genomic DNA of a procaryote or eucaryote . An "isolated 
nucleic acid molecule" may also comprise a cDNA molecule. 

With respect to RNA molecules of the invention 
the term "isolated nucleic acid" primarily refers to an 
RNA molecule encoded by an isolated DNA molecule as 
defined above. Alternatively, the term may refer to an 
RNA molecule that has been sufficiently separated from 
RNA molecules with which it would be associated in its 
natural state (i.e., in cells or tissues), such that it 
exists in a "substantially pure" form (the term 
"substantially pure" is defined below) . 

With respect to protein, the term "isolated 
protein" or "isolated and purified protein" is sometimes 
used herein. This term refers primarily to a protein 
produced by expression of an isolated nucleic acid 
molecule of the invention. Alternatively, this term may 
refer to a protein which has been sufficiently separated 



from other proteins with which it would naturally be 
associated, so as to exist in "substantially pure" form. 

The term "substantially pure" refers to a 
preparation comprising at least 50-60% by weight the 
compound of interest (e.g., nucleic acid, 
oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 75% by weight, and most 
preferably 90-99% by weight, the compound of interest. 
Purity is measured by methods appropriate for the 
compound of interest (e.g. chromatographic methods, 
agarose or polyacrylamide gel electrophoresis, HPLC 
analysis, and the like) . 

With respect to antibodies of the invention, 
the term "immunologically specific" refers to antibodies 
that bind to one or more epitopes of a protein of 
interest, but which do not substantially recognize and 
bind other molecules in a sample containing a mixed 
population of antigenic biological molecules. 

With respect to oligonucleotides, the term 
"specifically hybridizing" refers to the association 
between two single-stranded nucleotide molecules of 
sufficiently complementary sequence to permit such 
hybridization under pre -determined conditions generally 
used in the art (sometimes termed "substantially 
complementary") . In particular, the term refers to 
hybridization of an oligonucleotide with a substantially 
complementary sequence contained within a single-stranded 
DNA or RNA molecule of the invention, to the substantial 
exclusion of hybridization of the oligonucleotide with 
single-stranded nucleic acids of non- complementary 
sequence . 

The term "pathogen- inoculated" refers to the 
inoculation of a plant with a pathogen. 
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The term "disease defense response" refers to a 
change in metabolism, biosynthetic activity or gene 
expression that enhances the plant's ability to suppress 
the replication and spread of a microbial pathogen (i.e., 
5 to resist the microbial pathogen) . Examples of plant 

disease defense responses include, but are not limited 
to, production of low molecular weight compounds with 
antimicrobial activity (referred to as phytoalexins ) and 
induction of expression of defense (or defense-related) 

10 genes, whose products include, for example, peroxidases, 
cell wall proteins, proteinase inhibitors, hydrolytic 
enzymes, pathogenesis-related (PR) proteins and 
phytoalexin biosynthetic enzymes, such as phenylalanine 
ammonia lyase and chalcone synthase. Such defense 

15 responses appear to be induced in plants by several 

signal transduction pathways involving secondary defense 
signaling molecules produced in plants. Agents that 
induce disease defense responses in plants include, but 
are not limited to: (1) microbial pathogens, such as 

2 0 fungi, bacteria and viruses; (2) microbial components and 
other defense response elicitors, such as proteins and 
protein fragments, small peptides, (3-glucans, elicitins 
and harpins, cryptogein and oligosaccharides; and (3) 
secondary defense signaling molecules produced by the 

2 5 plant, such as salicylic acid, H 2 0 2 , ethylene and 

j asmonates . 

The term "promoter region" refers to the 5 ' 
regulatory regions of a gene. 

The term "reporter gene" refers to genetic 

3 0 sequences which may be operably linked to a promoter 

region forming a transgene, such that expression of the 
reporter gene coding region is regulated by the promoter 
and expression of the transgene is readily assayed. 
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The term "selectable marker gene" refers to a 
gene product that when expressed confers a selectable 
phenotype, such as antibiotic resistance, on a 
transformed cell or plant. 
5 The term "operably linked" means that the 

regulatory sequences necessary for expression of the 
coding sequence are placed in the DNA molecule in the 
appropriate positions relative to the coding sequence so 
as to effect expression of the coding sequence. This 

10 same definition is sometimes applied to the arrangement 
of coding sequences and transcription control elements 
(e.g. promoters, enhancers, and termination elements) in 
an expression vector. 

The term "DNA construct" refers to genetic 

15 sequence used to transform plants and generate progeny 

transgenic plants. These constructs may be administered 
to plants in a viral or plasmid vector. Other methods of 
delivery such as Agrobacterium T-DNA mediated 
transformation and transformation using the biolistic. 

20 process are also contemplated to be within the scope of 
the present invention. The transforming DNA may be 
prepared according to standard protocols such as those 
set forth in "Current Protocols in Molecular Biology", 
eds. Frederick M. Ausubel et al . , John Wiley & Sons, 

25 1998. 

II. Description of AVR1-C039 
and its Encoded Peptides 

30 In accordance with the present invention, a 

novel Magnaporthe grisea avirulence gene has been 
isolated and cloned. This gene is referred to herein as 
AVR1-C039, to denote its function as a gene that confers 
cultivar-specif ic interactions with rice cultivar C039. 



The cloning of an AVR1-C039 gene from M. grisea strain 
2539 and analysis of the gene are described in detail in 
Example 1. The gene contains four open reading frames, 
one of which (0RF3) appears to play the most key role in 
conferring cultivar specific avirulence to Magnaporthe 
isolates that carry the gene. Homologs of the strain 
2 53 9 isolate AVR1C039 gene have been identified in a 
diverse array of other Magnaporthe isolates. 

A genomic clone of AVR1-C039 from M. grisea 
strain 253 9, an exemplary AVR1-C039 of the invention, is 
described in detail herein and its nucleotide sequence is 
set forth in Example 1 as Sequence I.D. No. 1. Sequence 
I.D. No. 1 contains four open reading frames. It is 
believed that one or more of these open reading frames 
are responsible for conferring avirulence on cultivar 
C03 9, either by virtue of the gene product expressed from 
the open reading frame or by possession of critical 
transcription or translation regulatory sequences (see 
Example 1) . 

Although a genomic clone of AVR1-C039 from M . 
grisea isolate 2539 is described and exemplified herein, 
this invention is intended to encompass nucleic acid 
sequences and proteins from other Magnaporthe isolates 
that are sufficiently similar to be used instead of the 
isolate 2539 AVR1-C039 nucleic acid and proteins for the 
purposes described below. These include, but are not 
limited to, allelic variants and natural mutants of 
Sequence I.D. No. 1, which are likely to be found in any 
given population of Magnaporthe isolates. Because such 
variants are expected to possess certain differences in 
nucleotide and amino acid sequence, this invention 
provides an isolated AVR1 -C039 nucleic acid molecule 
having at least about 60% (preferably 70% and more 



preferably over 80%) sequence homology in the coding 
regions with the nucleotide sequence set forth as 
Sequence I.D. No. 1 (and, most preferably, specifically 
comprising the coding region of sequence I.D. No. 1). 
This invention also provides isolated polypeptide 
products of the open reading frames of Sequence I.D. No. 
1, having at least about 60% (preferably 70% or 80% or 
greater) sequence homology with the amino acid sequences 
of Sequence I.D. No's. 2, 3, 4, 5, 6 or 7, respectively. 
Because of the natural sequence variation likely to exist 
among AVR1-C039 genes, one skilled in the art would 
expect to find up to about 30-40% nucleotide sequence 
variation, while still maintaining the unique properties 
of the AVR1-C039 gene and encoded polypeptides of the 
present invention. Such an expectation is due in part to 
the degeneracy of the genetic code, as well as to the 
known evolutionary success of conservative amino acid 
sequence variations, which do not appreciably alter the 
nature of the encoded protein. Accordingly, such 
variants are considered substantially the same as one 
another and are included within the scope of the present 
invention . 

For purposes of this invention, the term 
"substantially the same" refers to nucleic acid or amino 
acid sequences having sequence variation that do not 
materially affect the nature of the protein (i.e. its 
structure and/or biological activity) . With particular 
reference to nucleic acid sequences, the term 
"substantially the same" is intended to refer to coding 
regions and to conserved sequences governing expression, 
and refers primarily to degenerate codons encoding the 
same amino acid, or alternate codons encoding 
conservative substitute amino acids in the encoded 



polypeptide. With reference to amino acid sequences, the 
term "substantially the same" refers generally to 
conservative substitutions and/or variations in regions 
of the polypeptide that do not affect structure or 
function. The terms "percent identity" and "percent 
similarity" are also used herein in comparisons among 
amino acid sequences. These terms are intended to be 
defined as they are in the UWGCG sequence analysis 
program (Devereaux et al . , Nucl . Acids Res. 12: 3 87-3 97, 
1984) , available from the University of Wisconsin. 

The following description sets forth the 
general procedures involved in practicing the present 
invention. To the extent that specific materials are 
mentioned, it is merely for purposes of illustration and 
is not intended to limit the invention. Unless otherwise 
specified, general cloning procedures, such as those set 
forth in Sambrook et al . , Molecular Cloning, Cold Spring 
Harbor Laboratory (1989) (hereinafter "Sambrook et al . " ) 
or Ausubel et al . (eds) Current Protocols i n Molecular 
Biology , John Wiley & Sons (1998) (hereinafter "Ausubel 
et al . " ) are used. 

A. Preparation of AVR1-C039 Nucleic 

Acid Molecules, encoded Polypeptides and 
Antibodies Specific for th e Polypeptides 

1 . Nucleic Acid Molecules 

AVR1-C039 nucleic acid molecules of the 
invention may be prepared by two general methods: (1) 
they may be synthesized from appropriate nucleotide 
triphosphates, or (2) they may be isolated from 
biological sources. Both methods utilize protocols well 
known in the art . 

The availability of nucleotide sequence 
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information, such as the genomic isolate having Sequence 
I.D. No. 1, enables preparation of an isolated nucleic 
acid molecule of the invention by oligonucleotide 
synthesis. Synthetic oligonucleotides may be prepared by 
5 the phosphoramadite method employed in the Applied 

Biosystems 38A DNA Synthesizer or similar devices. The 
resultant construct may be purified according to methods 
known in the art, such as high performance liquid 
chromatography (HPLC) . Long, double -stranded 

10 polynucleotides, such as a DNA molecule of the present 

invention, must be synthesized in stages, due to the size 
limitations inherent in current oligonucleotide synthetic 
methods. Thus, for example, a 1.05 kb double-stranded 
molecule may be synthesized as several smaller segments 

15 of appropriate complementarity. Complementary segments 
thus produced may be annealed such that each segment 
possesses appropriate cohesive termini for attachment of 
an adjacent segment. Adjacent segments may be ligated by 
annealing cohesive termini in the presence of DNA ligase 

20 to construct an entire 1.05 kb double-stranded molecule. 
A synthetic DNA molecule so constructed may then be 
cloned and amplified in an appropriate vector. 

AVR1-C039 genes also may be isolated from 
appropriate biological sources using methods known in the 

25 art. In one embodiment, a genomic clone has been 

isolated from a M. grisea strain 253 9 cosmid library. In 
an alternative embodiment, a cDNA clone comprising one or 
more of the open reading frames of the genomic AVR1-C03 9 
locus may be isolated. 

3 0 In accordance with the present invention, 

nucleic acids having the appropriate level sequence 
homology with part or all the coding regions of Sequence 
I.D. No. 1 may be identified by using hybridization and 
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washing conditions of appropriate stringency. For 
example, hybridizations may be performed, according to 
the method of Sambrook et al . , using a hybridization 
solution comprising: 5X SSC, 5X Denhardt 1 s reagent, 1.0% 
5 SDS, 100 jug/ml denatured, fragmented salmon sperm DNA, 
0.05% sodium pyrophosphate and up to 50% formamide. 
Hybridization is carried out at 37-42oc for at least six 
hours. Following hybridization, filters are washed as 
follows: (1) 5 minutes at room temperature in 2X SSC and 

10 1% SDS; (2) 15 minutes at room temperature in 2X SSC and 
0.1% SDS; (3) 30 minutes-1 hour at 37oC in 2X SSC and 
0.1% SDS; (4) 2 hours at 45-55oin 2X SSC and 0.1% SDS, 
changing the solution every 30 minutes. Alternatively, a 
modification of the Amasino hybridization protocol (Anal. 

15 Biochem. 152 : 304-307) is preferred for use in the 

present invention and is described in greater detail in 
Example 1 . 

One common formula for calculating the 
stringency conditions required to achieve hybridization 
20 between nucleic acid molecules of a specified sequence 
homology (Sambrook et al . , 198 9) : 

T m = 81.5°C + 16.6Log [Na+] + 0.4 1(% G+C) - 0.63 (% formamide) - 600/#bp in duplex 

25 As an illustration of the above formula, using [N+] = 

[0.368] and 50% formamide, with GC content of 42% and an 
average probe size of 200 bases, the T m is 57°C. The T m 
of a DNA duplex decreases by 1 - 1.5°C with every 1% 
decrease in homology. Thus, targets with greater than 

3 0 about 75% sequence identity would be observed using a 
hybridization temperature of 42°C. 

Nucleic acids of the present invention may be 
maintained as DNA in any convenient cloning vector. In a 
preferred embodiment, clones are maintained in plasmid 
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cloning/expression vector, such as pGEM-T (Promega 
Biotech, Madison, WI) or pBluescript (Stratagene, La 
Jolla, CA) , either of which is propagated in a suitable 
E. coli host cell . 
5 AVR1-C039 nucleic acid molecules of the 

invention include cDNA, genomic DNA, RNA, and fragments 
thereof which may be single- or doubl e- stranded . Thus, 
this invention provides oligonucleotides (sense or 
antisense strands of DNA or RNA) having sequences capable 

10 of hybridizing with at least one sequence of a nucleic 

acid molecule of the present invention, such as selected 
segments of the cDNA having Sequence I.D. No. 1. Such 
oligonucleotides are useful as probes for detecting A VR1 - 
C039 genes or mRNA in test samples of fungal isolates, 

15 e.g. by PCR amplification, or for the positive or 

negative regulation of expression of AVR1-C039 genes at 
or before translation of the mRNA into proteins. 



20 The AVR1-C039 genomic isolate described herein 

contains four open reading frames (ORF's 1-4), whose 
deduced amino acid sequences are set forth herein as 
Sequence I.D. No's. 2-5, respectively. Any one of these 
polypeptides may be prepared in a variety of ways, 

2 5 according to known methods. If produced in situ the 

polypeptides may be purified from appropriate sources, 
e.g., fungal isolates. 

Alternatively, the availability of nucleic acid 
molecules encoding the polypeptides enables production of 

3 0 the proteins using in vitro expression methods known in 

the art. For example, a cDNA or gene may be cloned into 
an appropriate in vitro transcription vector, such a 
pSP64 or pSP65 for in vitro transcription, followed by 
cell-free translation in a suitable cell-free translation 
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system, such as wheat germ or rabbit reticulocytes. In 
vitro transcription and translation systems are 
commercially available, e.g., from Promega Biotech, 
Madison, Wisconsin or BRL, Rockville, Maryland. 
5 According to a preferred embodiment, larger 

quantities of AVR1 -C03 9- encoded polypeptides may be 
produced by expression in a suitable procaryotic or 
eucaryotic system. For example, part or all of a DNA 
molecule, such as the cDNA having Sequence I.D. No. 1, 

10 may be inserted into a plasmid vector adapted for 

expression in a bacterial cell (such as E . coli) or a 
yeast cell (such as Saccharomyces cerevisiae) , or into a 
baculovirus vector for expression in an insect cell. 
Such vectors comprise the regulatory elements necessary 

15 for expression of the DNA in the host cell, positioned in 
such a manner as to permit expression of the DNA in the 
host cell. Such regulatory elements required for 
expression include promoter sequences, transcription 
initiation sequences and, optionally, enhancer sequences. 

20 The AVR1-C039 polypeptide (s) produced by gene 

expression in a recombinant procaryotic or eucyarotic 
system may be purified according to methods known in the 
art. In a preferred embodiment, a commercially available 
expression/secretion system can be used, whereby the 

2 5 recombinant protein is expressed and thereafter secreted 

from the host cell, to be easily purified from the 
surrounding medium. If expression/secretion vectors are 
not used, an alternative approach involves purifying the 
recombinant protein by affinity separation, such as by 

3 0 immunological interaction with antibodies that bind 

specifically to the recombinant protein. Such methods 
are commonly used by skilled practitioners. 

The A VR1-C03 9 -encoded polypeptides of the 
invention, prepared by the aforementioned methods, may be 
3 5 analyzed according to standard procedures. Methods for 
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analyzing the functional activity, i.e. ability to confer 
avirulence, are described in Example 1. 

The present invention also provides antibodies 
capable of immunospecif ically binding to polypeptides of 
5 the invention. Polyclonal or monoclonal antibodies 

directed toward any of the peptides encoded by the ORFs 
of AVR1-C039 may be prepared according to standard 
methods. Monoclonal antibodies may be prepared according 
to general methods of Kohler and Milstein, following 

10 standard protocols. In a preferred embodiment, 

antibodies are prepared, which react immunospecif ically 
with various epitopes of the AVR1 -C03 9- encoded 
polypeptides . 

Polyclonal or monoclonal antibodies that 

15 immunospecif ically interact with one or more of the 

polypeptides encoded by AVR1-C039 can be utilized for 
identifying and purifying such proteins. For example, 
antibodies may be utilized for affinity separation of 
proteins with which they immunospecif ically interact. 

2 0 Antibodies may also be used to immunoprecipitate proteins 
from a sample containing a mixture of proteins and other 
biological molecules. Other uses of the antibodies are 
described below. 

2 5 B. Uses of AVR1-C039 Nucleic Acids, 

Encoded Proteins and Antibodies 

The potential of recombinant genetic 
engineering methods to enhance disease resistance in 
agronomical ly important plants has received considerable 
30 attention in recent years. Protocols are currently 
available for the stable introduction of genes into 
plants, as well as for augmentation of gene expression. 
The present invention provides nucleic acid molecules 
which, upon stable introduction into a recipient plant, 

3 5 or into an epiphytic microorganism, can enhance the 
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plant's ability to resist pathogen attack. AVR1-C039- 
encoded proteins of the invention may also be applied 
directly to a plant, to induce a disease defense 
response . 

5 

1. AVR1-CQ39 Nucleic Acids 

AVR1-C039 nucleic acids (genomic clones or 
cDNAs) may be used for a variety of purposes in 
accordance with the present invention. The DNA, RNA, or 

10 fragments thereof may be used as probes to detect the 
presence of and/or expression of AVR1-C039 genes. 
Methods in which AVR1-C039 nucleic acids may be utilized 
as probes for such assays include, but are not limited 
to: (1) in situ hybridization; (2) Southern hybridization 

15 (3) northern hybridization; and (4) assorted 

amplification reactions such as polymerase chain 
reactions (PCR) . The AVR1-C03 9 nucleic acids of the 
invention may also be utilized as probes to identify 
homologs from other Ma.gn.apor the isolates. As described 

2 0 above, AVR1-C039 nucleic acids are also used to advantage 

to produce large quantities of substantially pure AVR1- 
C039 proteins, or selected portions thereof. 

Of perhaps greater significance, however, is 
the use of AVR1-C039 nucleic acids to broaden the scope 
25 of resistance of rice cultivars carrying the C039 
resistance gene to pathogens other than M. grisea 
isolates carrying the AVR1-C039 avirulence gene. For 
instance, in one embodiment of the invention, the AVR1 - 
C03 9 coding region is operably linked to a heterologous 

3 0 promoter, preferably one that is either generally 

pathogen inducible (i.e. inducible upon challenge by a 
broad range of pathogens) or wound inducible. Such 
promoters include, but are not limited to: 

a) promoters of genes encoding lipoxygenases 
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(preferably from plants, most preferably from rice, e.g., 
Peng et al . , J. Biol. Chem. 269: 3755-3761, 1994; Peng et 
al . , Abstract presented at the general meeting of the 
International Program on Rice Biotechnology, Malacca, 
5 Malaysia, Sept. 15-19, 1997); 

b) promoters of genes encoding peroxidases 
(preferably from plants, most preferably from rice, e.g., 
Chittoor et al . , Mol . Plant -Microbe Interactions 10.: 861- 
871, 1997) ; 

10 c) promoters of genes encoding 

hydroxymethylglutaryl-CoA reductase (preferably from 
plants, most preferably from rice, e.g., Nelson et al . , 
Plant Mol. Biol. 25: 401-412, 1994); 

d) promoters of genes encoding phenylalanine 
15 ammonia lyase (preferably from rice; e.g., Lamb et al . , 

Abstract of the general meeting of the International 
Program on Rice Biotechnology, Malacca, Malaysia, Sept. 
15-19, 1997) 

e) promoters of genes encoding glutathione-S- 

2 0 transferase (preferably from plants, most preferably from 
rice, or alternatively, the PRP1 promoter from potato) ; 

f) promoters from pollen- specif ic genes, such 
as corn Zmgl3 , which has been show to be expressed in 
rice transgenic pollen carrying the corn gene (Aldemita 

25 et al . , Abstract of the general meeting of the 

International Program on Rice Biotechnology, Malacca, 
Malaysia, Sept. 15-19, 1997); 

g) promoters from genes encoding chitinases 
(preferably from plants, most preferably from rice; e.g., 

30 Zhu & Lamb, Mol. Gen. Genet. 226 : 289-296, 1991); 

h) promoters from genes induced early (within 5 
hours post-inoculation) in the interaction of M. grisea. 
and rice (e.g., Bhargava & Hamer; Abstract B-10, 8th 
International Congress Molecular Plant Microbe 

35 Interactions, Knoxville, TN July 14-19, 1996) ; 
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i) promoters from plant (preferably rice) viral 
genes, either contained on a bacterial plasmid or on a 
plant viral vector, as described by Hammond- Kosack et 
al., Mol. Plant -Microbe Interactions 8.: 181-185 (1994); 
5 j ) promoters from genes involved in the plant 

(preferably rice) respiratory burst (e.g.. Groom et al . , 
Plant J. 10 (3) : 515-522, 1996); and 

k) promoters from plant (preferably rice) 
anthocyanin pathway genes (e.g., Reddy, pp 341-352 in 

10 Rice Genetics III , supra; Reddy et al . , Abstract of the 
general meeting of the International Program on Rice 
Biotechnology, Malacca, Malaysia, Sept. 15-19, 1997). 

The chimeric gene is then used to transform 
rice cultivars that already carry the appropriate R gene. 

15 Upon wounding or challenge with a plant pathogen, the 

resulting transgenic plants would be induced to produce 
the AVR1 -CO 3 9 gene product, thereby triggering the R gene 
defense response. In this embodiment, care must be taken 
to avoid using a promoter that is induced by necrosis, 

20 since use of such a promoter could result in a self- 
perpetuating hypersensitive response that may be lethal 
to the plant (see, e.g., Kim et al . , Proc . Natl. Acad. 
Sci. USA 91: 10445-10449, 1994). 

In a preferred embodiment, a coding region of 

25 AVR1-C039 (preferably the coding region corresponding to 
0RF3) is inserted into an expression vector in a 
microorganism that grows epiphytically on rice plants. A 
suspension of such recombinant microorganisms is sprayed 
on rice cultivars carrying the appropriate R gene. Upon 

30 pathogen attack, two levels of protection can occur: (1) 
the gene product produced by the recombinant epiphytes 
triggers an interaction on the plant surface that 
prevents further penetration by the pathogen (e.g., the 
fungal conidia develop appresoria, but do not develop 

35 penetration pegs) ; or (2) the gene product produced by 
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the recombinant epiphytes is carried into the plant 
tissue at the wound site, where it interacts with the 
corresponding R gene product and induces an internal 
disease defense response. Thus, this pre -treatment 
5 confers resistance to Maganporthe isolates (and, 

presumably, other plant pathogens) which normally are 
virulent on those cultivars. This embodiment is 
described in greater detail in Example 3 . 

In connection with the use of epiphytic 

10 bacteria, it should be noted that bacterial and phage 
expression and delivery systems, such as those 
commercially available from InVitrogen, will be 
particularly useful . The bacterial system expresses a 
protein hybrid with pilin, such that the foreign protein 

15 is exposed on the exterior of the bacterium. The phage 
system also expresses a hybrid protein with coat 
component and exterior exposure of the foreign protein. 

The AVR1-C039 gene also may be used as a tool 
to identify and isolate its corresponding R gene. Thus, 

2 0 in a manner similar to that described for isolation of 

the tomato CF-9 gene for resistance to Cladosporium 
fulvum (Jones et al . , Science 266 : 789-793, 1994), the R 
gene in rice that corresponds to AVR1-C039 can be 
isolated by transposon tagging: (1) AVR1 -C039 is 
25 transformed into, and const itutively expressed in a 

susceptible rice line; (2) the transgenic line is crossed 
with a resistant line that carries an identifiable 
transposon; (2) seedlings of Fl progeny constitut ively 
expressing both the Avr gene and the corresponding R gene 

3 0 should die, thereby enabling a simple screening for live 

Fl progeny; (4) any live Fl progeny should be surviving 
by virtue of interruption of either the AVR1-C039 
transgene or the corresponding R gene, presumably by the 
transposon. The transposon, along with the gene it has 
35 interrupted, can thus be isolated. 



2 . AVR1-C03 9 Proteins and Antibodies 



Purified gene products of AVR1-C039 , or 
fragments thereof, may be used to produce polyclonal or 
monoclonal antibodies, which also may serve as sensitive 
5 detection reagents for the presence and accumulation of 
AVR1-C03 9 polypeptides in transformed microbial 
epiphytes, transgenic plants, or other biological 
materials. Polyclonal or monoclonal antibodies 
immunologically specific for AVR1-C039 polypeptides may 

10 be used in a variety of assays designed to detect and 
quantitate the proteins. Such assays include, but are 
not limited to: (1) flow cytometric analysis; (2) 
immunochemical localization of expressed proteins in 
cells or tissues; and (3) immunoblot analysis (e.g., dot 

15 blot, Western blot) of extracts from various cells and 
tissues. Additionally, as described above, antibodies 
can be used for purification of AVR1-C039 polypeptides 
(e.g., affinity column purification, 
immunoprecipitation) . 

20 In a preferred embodiment, purified AVR1-C039 

polypeptides (most preferably from 0RF3) are used as a 
pre-treatment or co-treatment to confer broad- spectrum 
pathogen resistance to rice cultivars carrying the C03 9 R 
gene. Thus, in a manner similar to the above-described 

25 use of AVR1 -C039- expressing epiphytic microorganisms, a 
solution of the peptide is applied to the plants, and 
subsequent or concurrent wounding or inoculation with a 
pathogenic microorganism brings the peptide into contact 
with the R gene product, thereby stimulating a defense 

3 0 response. The inventors have experimentally demonstrated 
the feasibility of this approach, as described in detail 
in Example 4 . 



35 
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The following specific examples are provided to 
illustrate embodiments of the invention. They are not 
intended to limit the scope of the invention in any way. 



5 EXAMPLE 1 

Cloning and Analysis of AVR1-CQ39 

A chromosomal segment putatively containing the 
cultivar specificity gene, AVR1-C039, was isolated from 
10 M. grisea strain 253 9 using a map-based cloning approach, 
followed by chromosome walking (Leong et al . , 1996) . In 
this example we describe the identification, cloning and 
analysis of the AVR1-C039 gene. 



15 Methods ; 

Hybridization protocol. Hybridization methods 
were modified from Amasino (1986) "Acceleration of 
Nucleic Acid Hybridization Rate by Polyethylene Glycol", 
Analytical Biochemistry 152:304-307. The hybridization 

2 0 buffer was prepared according the Amasino protocol, but 

without the PEG and NaCl and with reduced concentrations 
of NaHP04 : 0.12 5M NaHP04 , 7% SDS, 5 0% formamide, 1.0 tnM 
EDTA, pH 7.2. High stringency conditions were used (42 
°C, 16 h) . Post hybridization washes were: one rinse 
25 with 2XSSC at room temperature; one wash in 2XSSC for 10 
min at 65 °C; one wash in 2XSSC, 15 min at 65 °C; one 
final wash in 0.1 X SSC, 0.1% SDS for 15 min at 65 °C. 
The final washing conditions were of greater stringency 
than were the hybridization conditions, giving a T m of 68. 

3 0 Thus, greater than 95% homology would be required to 

maintain a hybrid. None of the post hybridization 
phosphate- containing buffers described in Amasino (1986) 
were employed . 

Chromosome walking strategy. A total genomic 
35 DNA library of M. grisea strain 2539 consisting of 5,194 
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clones was constructed in cosmid vector pMLFl (Leong et 
al . , 1994, supra) and pMLF2 (An et al . , Gene 176 : 93-96, 
1996) . Clones were templated individually as colony 
blots as well as in pools in which the DNA was 
5 restriction digested, electrophoresed and blotted. The 
latter blots were used initially to identify candidate 
pools containing hybridizing clones. Colony blots 
derived from these pools were then screened. Steps were 
performed using endclones prepared from the insert DNA by 

10 digesting the cosmid clones with Apal , which does not 
digest the vector, and recirculari zing the plasmid by 
ligation. This procedure results in a derivative 
containing DNA from each end of the insert (An et al . , 
1996 supra) . Liberation of both ends of the insert from 

15 the vector was achieved by digesting with Apal and No tl . 
The required endclone was then identified by virtue of 
its inability to hybridize with the previous cosmid in 
the walk. 

Transformation of virulent strain Guyll with 

2 0 cosmids within the AVR1-C039 locus: Cosmids from within 

the genetic interval containing AVR1-C039 were introduced 
into Guyll using the transformation protocol described in 
LEUNG et al . (1990) . The procedure was modified as 
follows: After the protoplasts were incubated in 
25 complete medium (CM) +sorbitol , they were poured into 100 
ml molten (45°C) CM+2 0% sucrose agar. The agar was then 
poured into four petri plates. When the agar had 
solidified (1 h) it was overlaid with 15 ml of 1.5% water 
agar containing 800 jug/ml hygromycin B (300 //g/ml final 

3 0 concentration) . 

Creation of frameshift mutations in open 
reading frames at the AVR1-C039 gene locus. 1.05 kb 
fragment was cloned into pBSKS II + (pBSC03 9) to make the 
first two clones. The mutant 1.05 kb fragments were then 
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cloned into pCB1004 a hyg r vector from J. Sweigard 
(Dupont) . Plasmids were linearized by No tl digestion and 
transformed into Guyll protoplasts. 

Initial frameshift mutations were created in 
5 ORFs 2 and 3 by digestion and religation as follows: 
Frameshift in 0RF2 : The AccI site at 
nucleotide (nt) position 499 was cut and the 2 nt 3 ' 
overhangs were trimmed off with T4 DNA polymerase. The 
site was then religated resulting in the removal of 4 bp 

10 or a net frameshift of -2. The nucleotide sequence 

changed from 5' CTAGACAGTCTACCTCTCTGCCA 3' (SEQ ID NO: 9) 
to 5' C T AGAC AGTACC T C T C T G C C A 3' (SEQ ID NO: 10). 

Frameshift in ORFS 2 & 3 : The PflMl site at nt 
position 641 was cut and the 3 nt 3 ' overhang was trimmed 

15 off with T4 polymerase. A klenow-f illed Hindi I I fragment 
containing the streptomycin resistance gene cassette from 
pHP45Q (Prentki and Kritsch, Gene 29: 303, 1984) was 
ligated to the flush-ended PflMI fragment. The conserved 
Hindi I I site was then digested and religated. The net 

20 effect was the substitution of the 3 nt in the PflMI site 
with 4 nt from the Hindi I I site. This created a net 
frameshift of +1. The nucleotide sequence change was 
from 5* CCAGCAGCCAATGCTTGGAAAGATTG 3' (SEQ ID NO: 11) to 
5' CCAGCAGCCAAAGCTTTGGAAAGATTG 3' (SEQ ID NO: 12). 

25 In the AAccI construct, the peptide retains 

only 19 aa of its original sequence and is truncated 
after 3 6 aa . The native 0RF2 peptide is 77 aa . In the 
APfl construct, the ORF2 peptide sequence is almost 
unchanged except for the terminal 10 aa and the resulting 

3 0 peptide is 17 aa longer. 0RF3 , on the other hand, 

retains only 2 0 aa from its N- terminal and terminates 
after 31 aa . 

The frameshift in 0RF1 was created by "Quick 
Change" site-directed mutagenesis (Stratagene) using 
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primers designed to introduce an extra G nucleotide after 
the ATG: 

PI: CAACGTACTAGAAATGGAGTAATAAGTACC (SEQ ID NO: 13) 
P2: GGTACTTATTAGTCCATTTCTAGTACGTTG (SEQ ID NO: 14) 
5 The mutagenesis basically abolished the ORF 

completely . 

Creation of ATG mutations in open reading 
frames at the AVR1-C039 gene locus: The 1.05 kb fragment 
containing AVR1-C039 was cloned into pCB10 04 and 
10 Quickchange mutagenesis (Stratagene) was used to make the 
following mutant constructs: 

AORF1 (ATG- >TTT) : Start codon of ORF1 was eliminated by 
quick change mutagenesis using a primer with the mutant 
ATG sequence . 

15 AORF3 (ATG- >TTT) : Start codon of ORF3 was eliminated by 
quick change mutagenesis using a primer with the mutant 
ATG sequence . 

The following clone was made but the mutant 
allele has not yet been tested by transformation to 

2 0 determine the phenotype . 

AORF2 (ATG- >TTT) : Start codon of ORF2 was eliminated by 
quick change mutagenesis using a primer with the mutant 
ATG sequence . 

25 Results : 

As mentioned above, a gene conferring cultivar- 
specific interactions with rice cultivar C039 was 
isolated from M. grisea strain 2539 using a map-based 
cloning approach, followed by a 2 0 -step chromosome walk. 

3 0 The AVR1-C039 locus was delimited to a 1.0 5 kb region by 

subcloning and transformation of Guyll, a strain normally 
virulent on C03 9, to avirulence. The nucleotide sequence 
of this 1.05 kb region of Chromosome 1 is set forth below 



as SEQ ID N0:1: 



SEQ ID NO: 1 (5' - 3") 

GATCTGTAAA TTACATATAT 

5 

TCTTTTTAAA ATTTTAACGA 

ACAAATTAAC AATTCGTCCA 

10 TTTTTAATCC ATTTTTAAAA 

AGGGTACCCG GCTAGCATAG 

GATCTATACT ACATTCGGCG 
15 T5 

CACACCCTAA ATCATGCTGC 
ORF1- 

AC TAGAAATG ACTAATAAGT 
-ORF5 

2 0 TATTTAACGA AGCGTCCATT 
AGTGTTCTCC CTCCTCTGGA 

CTACCTCTCT GCCACCATTT 

2 5 T4 

ACTGAAACTT CTACAATTGA 

TCGCCACCCT TATCGCCCTT 

3 0 AAAGATTGCA TCATCCAACG 

TACTGCCAAT AGGAACGAAG 

ORF6- -ORF4 

TTAATGAGGC CTGCCATCCC 

35 

CTTTCTGGCG ATTTTACATC 
T6 

ATCAATAAGA GTCCAGGTTG 

4 0 ATTTATCTCG GAATAGCAGC 

GCGATAGGGA TATTAGAAGC 

GTCCTTGTCT ATGGATTCGC 

45 



TTATTTTGCC GCATTTTGCT AACCGCCTAT 

TTAAGAACGC AATTCAATTT TGCGTTCTAC 

AAAGAGGTAT TTAAGCGAAG ATTTGGCATT 

AAATACATCT GCTTTAACCC ACCTTTGCCA 

CCTTGGTTAC CAAAAACGGC TAAAGCTGTC 

CTCTGAACAA CTAAGCAACA GCGAGGAGAT 

TAGTAATGCG ATATAATGGC CAAACAACGT 

ACCCAGTCAA GTCAACTTGC TGTAGTATTA 
ORF2- 

TACTGCCAGG GCAAGTTTAT CAATGGGACC 
Tl 

CAACTCAGTT CTTTGCAAAC GCTAGACAGT 

TTACTTTTCA AAAATTTACT CCTTGCCGCT 
ORF3- 

AAGAGCCCAC AATGAAAGTC CAAGCTACAT 

GCGGCTTACT TTCCAGCAGC CAATGCTTGG 
T2 

TTATAAAGAC GGCGATGTCA ACAACATATA 

AGATAACTAT TGAGGAATAT AAAGTCTTCG 

TACCCAGTTA TACTTCCCGA CAGATCGGTC 

T3 

AGCTTACGCT GACGACGATG AGTCTTGTTG 
ORF7- 

AAAAATTCGC CACCATGGTA ATAGAGGGTT 
CGTGTGTGCA ATTATCACGG CTGTTCCTCT 
AGGACAAATT TACGGCAATA GCAACCAATT 
CCGTCGAATG GAGGCGACGG CGGATCC 
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DNA sequence analysis revealed four small open 



reading frames of 45, 77, 89 and G9 amino acids in length 
(0RF1, ORF2, ORF3, ORF4 , respectively, as shown on SEQ ID 
No. 1 above) . The amino acid sequences encoded by the 
5 four open reading frames are set forth below as Sequence 
ID No's. 2, 3, 4 and 5, respectively. Three other open 
reading frames were also identified (ORFS 5, 6, and 7, 
set forth below as SEQ ID NOS : 6, 7 and 8, respectively. 

10 AVR1-C039 ORF1 (SEQ ID NO: 2) 

MTNKYPVKST CCSIIFNEAS IYCQGKFING TSVLPPLDNS 
VLCKR 

AVR1-C039 ORF2 (SEQ ID NO: 3) 

15 MGPVFSLLWT TQFFANARQS TSLPPFLLFK NLLLAATETS 



TIERAHNESP SYIRHPYRPC GLLSSSQCLE RLHHPTL 



AVR1-C039 ORF3 (SEQ ID NO: 4) 



MKVQATFATL IALAAYFPAA NAWKDCI IQR YKDGDVNNIY 



20 



TANRNEEITI EEYKVFVNEA CHPYPVILPD RSVLSGDFTS 



AYADDDESC 



AVR1-C039 ORF4 (SEQ ID NO: 5) 



MAGLINEDFI FLNSYLFVPI GSIYWDIAV FITLDDAIFP 



25 



SIGCWKVSRK GDKGGECSLD FHCGLFQL 



AVR1-C039 ORF5 (SEQ ID NO: 6) 



MDASLNI ILQ QVDLTGYLLV ISSTLFGHYI ALLAA 



30 



AVR1-C039 ORF6 (SEQ ID NO: 7) 



MRPAI PTQLY FPTDRSFLAI LHQLTLTTMS LVDQ 



AVR1-C039 ORF7 (SEQ ID NO: 8) 
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MVIEGYLSRN SSRVCNYHGC SSAIGILEAG QIYGNSNQLS 
LSMDSPVEWR RRRI {continues beyond cloned DNA) 



The sequence surrounding the ATG of 0RF3 
5 matched four out of five of the conserved bases found in 
fungal translation start sites and contained a 
hydrophobic amino terminus punctuated by a lysine in 
position 2, and two putative cleavage sites for removal 
of the signal peptide. A fourth open reading frame 
10 (ORF4) was identified on the opposite strand. However, 
the sequence surrounding that ATG contained only two 
matches with the fungal translation start site consensus 
sequence . 

Site-directed mutations in 0RF1, ORF2 and ORF3 
15 were created in order to assess the roles of these ORFS 

in conferring avirulence . The translation start codon of 
each ORF was converted from ATG to TTT . In ORFs 1 and 3, 
these mutations led to loss of avirulence. Frameshift 
mutations in ORF1 and ORF3 also led to a loss of 
2 0 avirulence, while the frameshift mutation of ORF 2 did 

not. Taken together, these data indicate a role for ORF1 
and ORF3 in conferring avirulence to M. grisea strain 
2539 on rice cultivar C039. 

The absence of a splice site and a lariat 
2 5 sequence, as well as any putative TATA element 

immediately upstream of the ATG of ORF1 may indicate that 
ORF1 overlaps sequences critical to the promotion of 
transcription of AVR1-CQ39 . 



EXAMPLE 2 
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Distribution of AVR1-C039 Homologs 
in Diverse Isolates of Macrnaporthe crrisea 

The distribution of AVR1-C03 9 homologs was 
5 investigated by probing a large sample of host-specific 

forms of M. grisea with a segment of AVR1-C039 DNA, using 
hybridization conditions such as those described in 
Example 1. The results of this survey indicate that 
isolates infecting rice, Digitaria and wheat largely lack 

10 homologs of Avr-C039 . However, homologs of the gene were 
commonly found in Elutine (Setaria) - infecting isolates. 
Moreover, a detailed analysis of the AVR1-C039 locus from 
virulent rice isolate Guy 11 indicated that at least 20 
Kb of DNA corresponding to and containing the AVR1-C03 9 

15 locus of isolate 2539 was absent. 

EXAMPLE 3 

Improved Resistance to M. grisea Infection 

2 0 in Rice Plants Sprayed with Bacterial 

Epiphytes expressing ORF 3 of AVR1-CQ39 

The ORF3 of AVR1-C039 described in Example 1 
was transferred into a pET expression vector in 
25 Escherichia coli . A suspension containing the 

transformed E. coli was sprayed onto leaves of rice 
plants carrying the corresponding R gene for AVR1-C039. 
The plants were then inoculated with M. grisea isolate 
Guy 11, which is a virulent strain on the plant cultivars 

3 0 tested. As a control, plants were sprayed with an E. 

coli suspension which did not contain the ORF- 3 encoding 
plasmid, then inoculated with isolate Guy 11. 

Inoculated plants pre-treated with the ORF3 - 
expressing E. coli displayed reduced lesion size and 
3 5 number as compared to inoculated control plants pre- 
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treated with E. coli lacking the 0RF3 -expressing plasmid. 
These data support the role of 0RF3 in conferring 
avirulence in M. grisea. 

EXAMPLE 4 

Improved Resistance to M. grisea Infection 
in Rice Plants Sprayed with Protein Encoded by 
ORF 3 of AVR1-CQ39 

The 0RF3 of AVR1-C039 described in Example 1 
was transferred into a pET expression vector in 
Escherichia, coli. Protein extracts from IPTG- induced E. 
coli cells carrying either the pET vector alone (control) 
or the pET-0RF3 construct were tested for their effects 
on virulence. The cellular protein extracts were 
concentrated by ammonium sulfate precipitation. Cultivar 
C03 9 was inoculated with virulent M. grisea strain Guyll 
in combination with the concentrated protein extract to 
give 5 x 10 s conidia and 20 mg total protein extract in 10 
ml sterile water. 

Inoculated plants co- treated with the ORF3 - 
containing protein extract displayed reduced lesion size 
and number as compared to inoculated control plants co- 
treated with protein extract lacking ORF3 . These data 
further support the role of ORF3 in conferring avirulence 
in M. grisea. 

While certain of the preferred embodiments of 
the present invention have been described and 
30 specifically exemplified above, it is not intended that 
the invention be limited to such embodiments. Various 
modifications may be made thereto without departing from 
the scope and spirit of the present invention, as set 
forth in the following claims. 



10 
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SEQUENCE LISTING 



<110> Sally A. Leong 
Mark L . Farman 

<120> Cultivar Specificity Gene from the Rice 
Pathogen Ma.gna.por the grisea, and Methods of Use 



<130> WARF P98067WO 



<150> US 60/075,925 
<151> 1998-02-25 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1047 
<212> DNA 

<213> Magnaporthe grisea 
<400> 1 

gatctgtaaa ttacatatat ttattttgcc gcattttgct aaccgcctat tctttttaaa 60 

attttaacga ttaagaacgc aattcaattt tgcgttctac acaaattaac aattcgtcca 12 0 

aaagaggtat ttaagcgaag atttggcatt tttttaatcc atttttaaaa aaatacatct 180 

gctttaaccc acctttgcca agggtacccg gctagcatag ccttggttac caaaaacggc 240 

taaagctgtc gatctatact acattcggcg ctctgaacaa ctaagcaaca gcgaggagat 3 00 

cacaccctaa atcatgctgc tagtaatgcg atataatggc caaacaacgt actagaaatg 360 

actaataagt acccagtcaa gtcaacttgc tgtagtatta tatttaacga agcgtccatt 42 0 

tactgccagg gcaagtttat caatgggacc agtgttctcc ctcctctgga caactcagtt 480 

ctttgcaaac gctagacagt ctacctctct gccaccattt ttacttttca aaaatttact 540 

ccttgccgct actgaaactt ctacaattga aagagcccac aatgaaagtc caagctacat 600 

tcgccaccct tatcgccctt gcggcttact ttccagcagc caatgcttgg aaagattgca 660 

tcatccaacg ttataaagac ggcgatgtca acaacatata tactgccaat aggaacgaag 72 0 

agataactat tgaggaatat aaagtcttcg ttaatgaggc ctgccatccc tacccagtta 780 

tacttcccga cagatcggtc ctttctggcg attttacatc agcttacgct gacgacgatg 840 

agtcttgttg atcaataaga gtccaggttg aaaaattcgc caccatggta atagagggtt 90 0 

atttatctcg gaatagcagc cgtgtgtgca attatcacgg ctgttcctct gcgataggga 960 

tattagaagc aggacaaatt tacggcaata gcaaccaatt gtccttgtct atggattcgc 1020 

ccgtcgaatg gaggcgacgg cggatcc 1047 

<210> 2 
<211> 45 
<212> PRT 

<213> Magnaporthe grisea 
<400> 2 

Met Thr Asn Lys Tyr Pro Val Lys Ser Thr Cys Cys Ser lie lie Phe 

15 10 15 

Asn Glu Ala Ser lie Tyr Cys Gin Gly Lys Phe lie Asn Gly Thr Ser 

20 25 30 

Val Leu Pro Pro Leu Asp Asn Ser Val Leu Cys Lys Arg 



- 33 - 

<211> 77 
<212> PRT 

<213> Magnaporthe grisea 



<400> 3 

Met Gly Pro Val Phe Ser Leu Leu Trp Thr Thr Gin Phe Phe Ala Asn 

15 10 15 

Ala Arg Gin Ser Thr Ser Leu Pro Pro Phe Leu Leu Phe Lys Asn Leu 

20 25 30 

Leu Leu Ala Ala Thr Glu Thr Ser Thr lie Glu Arg Ala His Asn Glu 

35 40 45 

Ser Pro Ser Tyr lie Arg His Pro Tyr Arg Pro Cys Gly Leu Leu Ser 

50 55 60 

Ser Ser Gin Cys Leu Glu Arg Leu His His Pro Thr Leu 
65 70 75 



<210> 4 

<211> 89 

<212> PRT 

<213> Magnaporthe grisea 



<400> 4 

Met Lys Val Gin Ala Thr Phe Ala Thr Leu lie Ala Leu Ala Ala Tyr 

15 10 15 

Phe Pro Ala Ala Asn Ala Trp Lys Asp Cys lie lie Gin Arg Tyr Lys 

20 25 30 

Asp Gly Asp Val Asn Asn lie Tyr Thr Ala Asn Arg Asn Glu Glu lie 

35 40 45 

Thr lie Glu Glu Tyr Lys Val Phe Val Asn Glu Ala Cys His Pro Tyr 

50 55 60 

Pro Val lie Leu Pro Asp Arg Ser Val Leu Ser Gly Asp Phe Thr Ser 
65 70 75 80 

Ala Tyr Ala Asp Asp Asp Glu Ser Cys 



<210> 5 
<211> 68 
<212> PRT 

<213> Magnaporthe grisea 



<400> 5 
Met Ala Gly Leu 
1 

Phe Val Pro lie 
20 

Thr Leu Asp Asp 
35 

Arg Lys Gly Asp 
50 

Leu Phe Gin Leu 
65 



lie Asn Glu Asp 
5 

Gly Ser lie Tyr 

Ala lie Phe Pro 
40 

Lys Gly Gly Glu 
55 



Phe lie Phe Leu 
10 

Val Val Asp lie 
25 

Ser lie Gly Cys 

Cys Ser Leu Asp 
60 



Asn Ser Tyr Leu 
15 

Ala Val Phe lie 
30 

Trp Lys Val Ser 
45 

Phe His Cys Gly 



<210> 6 

<211> 35 

<212> PRT 

<213> Magnaporthe grisea 



:400> 6 



- 34 - 



Met Asp Ala Ser Leu Asn lie lie Leu Gin Gin Val Asp Leu Thr Gly 

15 10 15 

Tyr Leu Leu Val lie Ser Ser Thr Leu Phe Gly His Tyr lie Ala Leu 
20 25 30 

Leu Ala Ala 



<210> 7 
<211> 34 
<212> PRT 

<213> Magnaporthe grisea 



<400> 7 

Met Arg Pro Ala lie Pro Thr Gin 

1 5 
Phe Leu Ala lie Leu His Gin Leu 
20 

Asp Gin 



Leu Tyr Phe Pro Thr Asp Arg Ser 

10 15 
Thr Leu Thr Thr Met Ser Leu Val 
25 30 



<210> 8 
<211> 54 
<212> PRT 

<213> Magnaporthe grisea 



Met Val lie Glu Gly Tyr Leu Ser Arg Asn Ser Ser Arg Val Cys Asn 

15 10 15 

Tyr His Gly Cys Ser Ser Ala lie Gly lie Leu Glu Ala Gly Gin lie 

20 25 30 

Tyr Gly Asn Ser Asn Gin Leu Ser Leu Ser Met Asp Ser Pro Val Glu 

35 40 45 

Trp Arg Arg Arg Arg lie 



<210> 9 
<211> 23 
<212> DNA 

<213> Magnaporthe grisea 



<400> 9 
ctagacagtc tacctctctg cca 

<210> 10 
<211> 21 
<212> DNA 

<213> Magnaporthe grisea 

<400> 10 
ctagacagta cctctctgcc a 

<210> 11 
<211> 26 
<212> DNA 

<213> Magnaporthe grisea 

<400> 11 
ccagcagcca atgcttggaa agattg 
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<210> 12 
<211> 27 
<212> DNA 

<213> Magnaporthe grisea 

<400> 12 
ccagcagcca aagctttgga aagattg 

<210> 13 
<211> 30 
<212> DNA 

<213> Magnaporthe grisea 
<400> 13 

caacgtacta gaaatggagt aataagtacc 

<210> 14 
<211> 30 
<212> DNA 

<213> Magnaporthe grisea 



<400> 14 

ggtacttatt agtccatttc tagtacgttg 



We claim: 



1. An isolated nucleic acid molecule from 
Magnaporthe grisea that confers rice cultivar C03 9- 

5 specific avirulence to fungal plant pathogens that 
contain the nucleic acid. 

2. The nucleic acid molecule of claim 1, which 
is AVR1-C03 9. 

10 

3. The nucleic acid molecule of claim 2, 
having a sequence comprising part or all of SEQ ID NO : 1 . 

4. The nucleic acid molecule of claim 1, which 
15 encodes a polypeptide having the features of a 

polypeptide comprising a sequence selected from the group 
consisting of SEQ ID NO : 2 , SEQ ID NO : 3 , SEQ ID NO : 4 , SEQ 
ID NO: 5, SEQ ID NO : 6 , SEQ ID NO : 7 and SEQ ID NO : 8 . 

20 5. The nucleic acid molecule of claim 4, which 

encodes a polypeptide having a sequence selected from the 
group consisting of SEQ ID NO : 2 , SEQ ID NO : 3 , SEQ ID 
NO:4, SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 and SEQ ID 
NO: 8 . 

25 

6. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 1, operably linked to a 
vector for transforming cells. 

30 7. A cell transformed with the recombinant DNA 

molecule of claim 6 . 



8. The cell of claim 7, selected from the 
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group consisting of bacterial cells, fungal cells, insect 
cells and plant cells. 



9 . 



The transformed cell of claim 8, which is 



5 



an epiphytic bacterial cell. 



10. A transgenic plant regenerated from the 



transformed cell of claim 8 . 



10 



11. An isolated nucleic acid molecule having a 
sequence selected from the group consisting of: 

a) part or all of SEQ ID NO : 1 ; 

b) an allelic variant of part or all of SEQ 



ID NO:l; 



15 



c) a natural mutant of part or all of SEQ 



ID N0:1; 



d) a sequence hybridizing with part or all 



of SEQ ID NO : 1 or its complement and encoding a 
polypeptide substantially the same as any of the 

2 0 polypeptides encoded by SEQ ID NO : 1 ; and 

e) a sequence encoding part or all of a 
polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO : 2 SEQ ID NO : 3 , SEQ ID 
NO:4, SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 and SEQ ID 

2 5 NO : 8 . 

12. An oligonucleotide between about 10 and 
about 100 nucleotides in length, which specifically 
hybridizes with a portion of the nucleic acid molecule of 

30 claim 11. 

13. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 11, operably linked to a 
vector for transforming cells. 

35 
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14. A cell transformed with the recombinant DNA 
molecule of claim 13 . 



15. The cell of claim 14, selected from the 

5 group consisting of bacterial cells, yeast cells and plant 
cells . 

16. The cell of claim 15, which is an epiphytic 
bacterial cell . 

10 

17. A transgenic plant regenerated from the 
cell of claim 15. 

18. A polypeptide encoded by the nucleic acid 
15 molecule of claim 11. 

19. Antibodies immunologically specific for the 
polypeptide of claim 18. 

20 20. A protein encoded by an isolated nucleic 

acid molecule from Magnaporthe grisea that confers rice 
cultivar C03 9-specif ic avirulence to fungal plant 
pathogens that contain the nucleic acid. 

25 21. The protein of claim 20, encoded by an 

AVR1-C039 gene. 

22. The protein of claim 21, which is encoded 
by ORF 3 of AVR1-C039. 

30 

23. The protein of claim 20, having an amino 
acid sequence selected from the group consisting of SEQ ID 
NO: 2, SEQ ID NO : 3 , SEQ ID NO : 4 , SEQ ID NO : 5 , SEQ ID NO : 6 , 
SEQ ID NO: 7 and SEQ ID NO : 8 . 

i 
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24. Antibodies immunologically specific for the 
protein of claim 20. 

25. A transgenic epiphytic bacterium that 

5 expresses a portion of an AVR1-C039 gene effective to 
confer rice cultivar C039-specif ic avirulence to 
microorganisms expressing the gene. 

26. The transgenic epiphytic bacterium of claim 
10 24, which expresses ORF3 of SEQ ID NO : 1 , or a functional 

equivalent . 

27. A method of enhancing the scope of 
resistance of rice cultivar C039 plants to pathogenic 

15 microorganisms, which comprises treating the plants with a 
polypeptides produced by expression of AVR1-C039 , in an 
amount effective to trigger expression of a C03 9- specif ic 
R gene in the plants. 

2 0 28. The method of claim 27, comprising treating 

the plants with a solution comprising the polypeptides. 

29. The method of claim 27, which comprises 
treating the plants with an epiphytic bacterium that 
25 expresses a portion of an AVR1-C039 gene that produces the 
polypeptides effective to trigger expression of the C039- 
specific R gene in the plants. 



ABSTRACT 

This invention provides a novel avirulence gene 
from the rice blast pathogen, Magnaporthe grisea. The 
gene, AVR1-C039, confers cultivar-specif ic avirulence to 
strains of M. grisea that carry the gene. Also disclosed 
are methods of using the gene and its encoded products for 
improving resistance of rice to the rice blast pathogen. 



